智能论文笔记

Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection

Debottam Dutta , Debarpan Bhattacharya , Sriram Ganapathy , Amir H. Poorjam , Deepak Mittal , Maneesh Singh

分类：机器学习

2022-06-27

在本文中，我们描述了一种表示音频信号的表示方法，以实现COVID-19检测任务。将原始音频样品用1D卷积过滤器进行处理，这些过滤器被参数化为余弦调制的高斯函数。这些内核的选择允许将滤纸解释为光滑的带通滤波器。过滤后的输出汇总，对数压缩并用于基于自我注意的相关加权机制。相关权重强调了时间频分解的关键区域，这对于下游任务很重要。该模型的后续层由复发架构组成，模型经过训练，以执行COVID-19检测任务。在我们对COSWARA数据集的实验中，我们表明，所提出的模型在基线系统以及其他表示学习方法上实现了显着的性能改进。此外，提出的方法被证明适用于语音和呼吸信号以及从较大的数据集中转移学习。

translated by 谷歌翻译

Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals

Debarpan Bhattacharya , Debottam Dutta , Neeraj Kumar Sharma , Srikanth Raj Chetupalli , Pravin Mote , Sriram Ganapathy , Chandrakiran C , Sahiti Nori , Suhail K K , Sadhana Gonuguntla

分类：机器学习

2022-06-24

COVID-19导致与不同的SARS-COV-2变体相关的多种感染波。研究报告了这些变体对患者呼吸健康的影响不同。我们探索从COVID-19受试者收集的声学信号是否显示出可区分的声学模式，这表明有可能预测潜在的病毒变体。我们分析了从三个主题库中收集的COSWARA数据集，即i）健康，ii）在三角洲变体占主导地位期间记录的covid-199受试者，以及III）来自Omicron Expear中记录的COVID-19的数据。我们的发现表明，咳嗽，呼吸和语音等多种声音类别表明，在将COVID-19与Omicron和Delta变体进行比较时，声音特征差异很大。在曲线下，分类区域大大超过了被Omicron感染的受试者与三角洲感染者的机会。使用来自多个声音类别的得分融合，我们在95％的特异性下获得了89％和52.4％的敏感性的区域。此外，使用分层三类方法将声学数据分类为健康和共同-19阳性，并将进一步的COVID受试者分为三角洲和Omicron变体，从而提供了高水平的3类分类精度。这些结果提出了设计基于声音的COVID-19诊断方法的新方法。

translated by 谷歌翻译

Svadhyaya system for the Second Diagnosing COVID-19 using Acoustics Challenge 2021

Deepak Mittal , Amir H. Poorjam , Debottam Dutta , Debarpan Bhattacharya , Zemin Yu , Sriram Ganapathy , Maneesh Singh

分类：机器学习

2022-06-11

该报告描述了用于在第二次DICOVA挑战中使用三种不同的声学模态（即语音，呼吸和咳嗽）来检测COVID-19阳性的系统。所提出的系统基于4种不同方法的组合，每种方法都集中在问题的一个方面上，并在呼吸，咳嗽和语音轨道上分别达到86.41、77.60和84.55的盲试AUC，并且这三个轨道的融合中的AUC为85.37。

translated by 谷歌翻译

Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms

Debarpan Bhattacharya , Debottam Dutta , Neeraj Kumar Sharma , Srikanth Raj Chetupalli , Pravin Mote , Sriram Ganapathy , Chandrakiran C , Sahiti Nori , Suhail K K , Sadhana Gonuguntla

分类：机器学习

2022-06-09

COVID-19大流行已经加快了关于替代，快速有效的Covid-19诊断方法设计的研究。在本文中，我们描述了Coswara工具，这是一个网站应用程序，旨在通过分析呼吸声样本和健康症状来启用COVID-19检测。使用此服务的用户可以使用连接到Internet的任何设备登录到网站，提供当前的健康症状信息，并记录很少有对应于呼吸，咳嗽和语音的声音。在分析此信息上的一分钟内，网站工具将向用户输出COVID-19概率分数。随着COVID-19的大流行继续要求进行大规模和可扩展的人口水平测试，我们假设所提出的工具为此提供了潜在的解决方案。

translated by 谷歌翻译

Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions

Mayur Patidar , Avinash Singh , Prayushi Faldu , Lovekesh Vig , Indrajit Bhattacharya , Mausam

分类：自然语言处理 | 人工智能

2022-12-20

When answering natural language questions over knowledge bases (KBs), incompleteness in the KB can naturally lead to many questions being unanswerable. While answerability has been explored in other QA settings, it has not been studied for QA over knowledge bases (KBQA). We first identify various forms of KB incompleteness that can result in a question being unanswerable. We then propose GrailQAbility, a new benchmark dataset, which systematically modifies GrailQA (a popular KBQA dataset) to represent all these incompleteness issues. Testing two state-of-the-art KBQA models (trained on original GrailQA as well as our GrailQAbility), we find that both models struggle to detect unanswerable questions, or sometimes detect them for the wrong reasons. Consequently, both models suffer significant loss in performance, underscoring the need for further research in making KBQA systems robust to unanswerability.

translated by 谷歌翻译

AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning

Aowabin Rahman , Arnab Bhattacharya , Thiagarajan Ramachandran , Sayak Mukherjee , Himanshu Sharma , Ted Fujimoto , Samrat Chatterjee

分类：机器人 | 机器学习

2022-12-20

Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-autonomous operations. However, in remote environments where connectivity is limited and human intervention is often not possible, decentralized collaboration strategies are needed for fully-autonomous operations. Nevertheless, decentralized coordination may be ineffective in adversarial environments due to sensor noise, actuation faults, or manipulation of inter-agent communication data. In this paper, we propose an algorithmic approach based on adversarial multi-agent reinforcement learning (MARL) that allows robots to efficiently coordinate their strategies in the presence of adversarial inter-agent communications. In our setup, the objective of the multi-robot team is to discover targets strategically in an obstacle-strewn geographical area by minimizing the average time needed to find the targets. It is assumed that the robots have no prior knowledge of the target locations, and they can interact with only a subset of neighboring robots at any time. Based on the centralized training with decentralized execution (CTDE) paradigm in MARL, we utilize a hierarchical meta-learning framework to learn dynamic team-coordination modalities and discover emergent team behavior under complex cooperative-competitive scenarios. The effectiveness of our approach is demonstrated on a collection of prototype grid-world environments with different specifications of benign and adversarial agents, target locations, and agent rewards.

translated by 谷歌翻译

Task Preferences across Languages on Community Question Answering Platforms

Sebastin Santy , Prasanta Bhattacharya , Rishabh Mehrotra

分类：自然语言处理

2022-12-18

With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks. Moreover, the rapid proliferation and localization of these platforms spanning geographic and linguistic boundaries offer a unique opportunity to study the task requirements and preferences of users in different socio-linguistic groups. In this study, we implement an entity-embedding model trained on a large longitudinal dataset of multi-lingual and task-oriented question-answer pairs to uncover and quantify the (i) prevalence and distribution of various online tasks across linguistic communities, and (ii) emerging and receding trends in task popularity over time in these communities. Our results show that there exists substantial variance in task preference as well as popularity trends across linguistic communities on the platform. Findings from this study will help Q&A platforms better curate and personalize content for non-English users, while also offering valuable insights to businesses looking to target non-English speaking communities online.

translated by 谷歌翻译

Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand

Daniel Garces , Sushmita Bhattacharya , Stephanie Gil , Dimitri Bertsekas

分类：人工智能 | 机器学习 | 机器人

2022-11-28

We derive a learning framework to generate routing/pickup policies for a fleet of vehicles tasked with servicing stochastically appearing requests on a city map. We focus on policies that 1) give rise to coordination amongst the vehicles, thereby reducing wait times for servicing requests, 2) are non-myopic, considering a-priori unknown potential future requests, and 3) can adapt to changes in the underlying demand distribution. Specifically, we are interested in adapting to fluctuations of actual demand conditions in urban environments, such as on-peak vs. off-peak hours. We achieve this through a combination of (i) online play, a lookahead optimization method that improves the performance of rollout methods via an approximate policy iteration step, and (ii) an offline approximation scheme that allows for adapting to changes in the underlying demand model. In particular, we achieve adaptivity of our learned policy to different demand distributions by quantifying a region of validity using the q-valid radius of a Wasserstein Ambiguity Set. We propose a mechanism for switching the originally trained offline approximation when the current demand is outside the original validity region. In this case, we propose to use an offline architecture, trained on a historical demand model that is closer to the current demand in terms of Wasserstein distance. We learn routing and pickup policies over real taxicab requests in downtown San Francisco with high variability between on-peak and off-peak hours, demonstrating the ability of our method to adapt to real fluctuation in demand distributions. Our numerical results demonstrate that our method outperforms rollout-based reinforcement learning, as well as several benchmarks based on classical methods from the field of operations research.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Correlated Feature Aggregation by Region Helps Distinguish Aggressive from Indolent Clear Cell Renal Cell Carcinoma Subtypes on CT

Karin Stacke , Indrani Bhattacharya , Justin R. Tse , James D. Brooks , Geoffrey A. Sonn , Mirabela Rusu

分类：计算机视觉

2022-09-29

肾细胞癌（RCC）是一种常见的癌症，随着临床行为的变化。懒惰的RCC通常是低级的，没有坏死，可以在没有治疗的情况下监测。激进的RCC通常是高级的，如果未及时检测和治疗，可能会导致转移和死亡。虽然大多数肾脏癌在CT扫描中都检测到，但分级是基于侵入性活检或手术的组织学。确定对CT图像的侵略性在临床上很重要，因为它促进了风险分层和治疗计划。这项研究旨在使用机器学习方法来识别与病理学特征相关的放射学特征，以促进评估CT图像而不是组织学上的癌症侵略性。本文提出了一种新型的自动化方法，即按区域（Corrfabr）相关的特征聚集，用于通过利用放射学和相应的不对齐病理学图像之间的相关性来对透明细胞RCC进行分类。 CORRFABR由三个主要步骤组成：（1）特征聚集，其中从放射学和病理图像中提取区域级特征，（2）融合，放射学特征与病理特征相关的放射学特征在区域级别上学习，并且（3）在其中预测的地方学到的相关特征用于仅使用CT作为输入来区分侵略性和顽固的透明细胞RCC。因此，在训练过程中，Corrfabr从放射学和病理学图像中学习，但是在没有病理图像的情况下，Corrfabr将使用CORFABR将侵略性与顽固的透明细胞RCC区分开。 Corrfabr仅比放射学特征改善了分类性能，二进制分类F1分数从0.68（0.04）增加到0.73（0.03）。这证明了将病理疾病特征纳入CT图像上透明细胞RCC侵袭性的分类的潜力。

translated by 谷歌翻译